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Despite the well- documented benefits of working in teams, teamwork also results in communication, 
coordination and management costs, and may lead to personal conflict between team members. In a context 
where teams play an increasingly important role, it is of major importance to understand conflict and to 
develop diagnostic tools to avert it. Here, we investigate empirically whether it is possible to quantitatively 
predict future conflict in small teams using parameter- free models of social network structure. We analyze 
data of conflict appearance and resolution between 86 team members in 16 small teams, all working in a real 
project for nine consecutive months. We find that group-based models of complex networks successfully 
anticipate conflict in small teams whereas micro-based models of structural balance, which have been 
traditionally used to model conflict, do not. 

Teamwork is increasingly important. In science, where teamwork has been best studied quantitatively thanks 
to the large amount of data available, studies have forecast the shift from individual work to teams for over a 
century 1 . Indeed, the classic studies of de Solla Price predicted that by 1980 no articles in chemistry would be 
authored by single authors 2 . Although these predictions have not come true, recent studies indicate that team- 
work has become more frequent in virtually all fields and subfields of science 3,4 . Parallel to this, there has been an 
increase in the impact of works produced by teams to the point that, today, the most highly cited works in all fields 
are overwhelmingly produced by teams 4 ' 5 . This tendency is due in part to external factors such as the increased 
complexity of cutting edge research 3 , the widespread use of new technologies 5 , the growth of the number of 
researchers, and the trend towards greater specialization 4 . There is also mounting evidence that diversity provides 
an intrinsic advantage to teams, and that teams composed by diverse individuals have higher performance than 
teams composed by similar individuals 6,7,3 . All in all, the "collective intelligence factor" of a team is a better 
predictor of team performance than the abilities of each team member 8 . 

Despite the benefits of teams, teamwork also results in communication, coordination and management 
costs 9,10 . More importantly, conflict arises in teams from tension among members. Conflict (in particular, conflict 
that is related to personal relationships) is known to interfere with team functioning and may offset the benefits of 
teamwork 6,11 " 14 . In a context where teams play an increasingly important role, it is important to understand 
conflict and to develop diagnostic tools to avert it. 

Here, we investigate empirically whether it is possible to quantitatively predict future conflict in small teams. 
Rather than using regression analysis for "conflict forensics" (that is, to explain a posteriori what factors correlate 
with higher levels of conflict in a given team) 13,15 " 17 , we focus on first-principles parameter- free models of social 
network structure, and on prediction rather than postdiction. As recently suggested 18,19 , analyzing teams as 
networks poses the methodological dilemma of choosing between "micro-level" sociopsychological theories such 
as structural balance 20 " 24 and "macro-level" theories developed in the context of network science 25 " 27,19 . We show 
that, paradoxically, statistical network methods can successfully anticipate conflict in small teams whereas some 
of the most widely-used micro-level sociological theories cannot. 

Results 

Measuring conflict and conflict evolution in small teams. Our study draws upon a long history of network 
experiments with teams and small groups, dating back to the experiments carried out in the 50's by the Group 
Networks Laboratory at MIT 28,19 . We analyze data from 16 small teams with 3 to 7 members each, for a total of 86 
team members and 374 reported within-team interactions between them (Table 1). All teams worked in the same 
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open-ended project for nine consecutive months. Importantly, the 
teams we analyze were facing a real task as opposed to a simplified 
experimental task, and their members had real incentives and 
experienced real conflicts that developed throughout the extended 
duration of the project. 

To track the development of conflict in the teams, we administered 
the same survey to all team members twice, at the middle and end of 
the project (surveys I and II, four and nine months into the project, 
respectively). In the survey we asked all individuals about their dis- 
position to work with each of the other team members in the future. 
We use the answers to this question to construct two directed net- 
works for each team in which a link from member A to member B can 
be of two types, namely, l AB = Y if the answer was positive (A is 
willing to work with B in the future) and l AB = N if the answer was 
negative (Fig. 1). We use changes in link type between the two sur- 
veys as a proxy for conflict appearance/resolution: conflict appears 
when a link of type Y in survey I becomes N in survey II; conflict 
resolves when a link of type N in survey I becomes Y in survey II. In 
what follows, we will denote interactions between team members as 
W, I 1 being the link type from survey I and F the link type from 
survey II (for example YN denotes a link where conflict arose during 
the project). Of the 374 reported interactions between individuals, 
257 were YY, 36 were YN, 30 were NY and 51 were NN (Table 1). 

Structural balance versus block model-based link reliability. In 

social network analysis, conflict evolution has traditionally been 
studied using the concept of "balance," which focuses on the state 
of network triads (or, more generally, network cycles) 20 " 24 . In a 
directed graph, a triad is in a balanced state when there is an odd 
number of positive reciprocal connections between individuals 22 ; 
otherwise, a triad is in an unbalanced state. According to this 
theory, unbalanced states produce tension and generate changes 
towards balance 20 . For example, if A and B have a positive 
relationship and so do A and C, then if B and C have a negative 
relationship (so that the number of positive reciprocal interactions 
in the triad is two) there is a tension pushing towards either the B-C 
relationship becoming positive or one of the others becoming 
negative. Since the idea of balance revolves around the relation- 
ships between small groups of individuals (in this sense we say that 
it is a "micro-level" theory), it seems a priori well-suited to study the 
evolution of conflicts in teams. 

At the other end of the spectrum of social network models, block 
models postulate that social actors can be classified into groups such 
that all actors within a group have similar patterns of interactions 



with actors in other groups 29 " 32 . These are "macro-level" models in 
which the fundamental unit of the models is the group, not the 
individual, and therefore seem a priori less well-suited to study small 
teams. However, methods based on block model inference are known 
to accurately identify reliable and unreliable interactions in large 
complex networks 32 . 

Given these considerations, we compare the ability of structural 
balance theory to predict conflicts within teams to that of a statistical 
method that uses block models to describe team interactions (Fig. 1). 

In particular, we are interested in predicting the state of each link 
l^ B in the second survey, based on the structure of the team network 
in the first survey using two methods: the structural balance method 
(SB) and the link reliability method (LR). SB focuses on the balance of 
relations induced by the presence of a positive (Y) interaction from 
member A to member B. In particular, we define the S SB score of each 
link as the difference between the number of balanced triangles £ ba i 
within the team when l AB = Y and the number of balanced triangles 
within the team when f AB = N, that is Sf B = t ha i(Y) - f ba i(N) (Fig. lc). 

In contrast, LR uses a Bayesian approach to sample over all pos- 
sible stochastic block models of a network to estimate the "reliability" 
S LR of each link, that is, the probability that the link is of type Y based 
on the observation of the whole team network obtained from survey I 
(Fig. lb and Methods) 32 ' 33 . 

Conflict prediction performance. Note that whereas the LR method 
assigns a probability for each link to become Y or N, the SB method 
does not, thus we cannot directly compare outputs from the two 
methods for each of the links. To compare both methods we 
analyze instead their ability to rank links within teams. From a 
ranking perspective, we expect that the higher the score the larger 
the probability that the link is of type Y in survey II; conversely, the 
lower the score, the larger the probability that a link is of type N in 
survey II. 

To measure the ranking accuracy in the case of conflict appear- 
ance, we take, for each team, all possible (YY, YN) link pairs and 
calculate the number of times the YY link in the pair has a higher 
score than the YN link in the pair, according to each method. 
Conversely, for conflict resolution, we record the number of times 
that the NY link has a score higher than the NN for all possible (NY, 
NN) link pairs within each team. 

For the LR method we find that YY links have higher scores than 
YN links 61% of the time (conflict appearance), and that NY links 
have scores higher than NN links 67% of the time (conflict resolu- 
tion). This means that, using the LR method, links with a lower score 
are consistently more likely to produce conflict in the future (survey 
II), both when conflict exists and when it does not exist at the time of 
survey I. In contrast, for the SB method, YY links have higher scores 
than YN links only in 47% of the cases, and NY links have scores 
higher than NN links in 55% of the cases. 

To assess the significance of these results we proceed as follows. 
For conflict appearance, we consider the ratio Uyy/^yn between the 
number of times that the score of a Wink is higher than the score of 
a YN link (nyy) and the number of times the reverse is true (u Y n)- 
Analogously, for conflict resolution we consider the ratio n NY /n NN 
between the number of times that the score of a iVY link is higher than 
the score of a iVN link (n NY ) and the number of times the the reverse is 
true (n NN ). We denote these ratios as the normalized prediction 
performance for the appearance of conflict (yiyyI m-yn) an d for the 
resolution of conflict (n NY ^NN) (Fig- 2a), respectively. We compare 
the values obtained for the SB and LR methods to those obtained by 
resampling the scores of all links, which corresponds to a null model 
in which links are not separated at all. We find that, at a 5% signifi- 
cance level, the LR method is significantly more accurate than the 
null model at predicting both the appearance (with p-value p = 
0.030) and resolution (p = 0.032) of conflicts. In contrast, the SB 
method is not (p = 0.704 andp = 0.232, respectively). 



SCIENTIFIC REPORTS | 3 : 1999 | DOI: 1 0.1 038/srep01 999 



2 



Method 



Prediction of observation 2 



Observation 1 



_Q 

CD 




CD 
O 
C 
03 
CD 
-Q 

"CD 
s_ 

O 



CO 



A B C D E F 



A C E B D F 




Likelihood t t 



Likelihood I I 



• b b 




s =1.0 



,- Y 



Link AC 



Balanced 



Unbalanced 



Balanced 



Unbalanced 



d cw We c 





B C^^=^^D C 





S SB = number of balanced triangles with link = Y - number of 
balanced triangles with link = N 




| S™>3 
S™= 2 
S 5B =1 

s SB = 0 



Figure 1 | Parameter- free network methods for conflict prediction, (a) For each team, we build a network using the information from survey I 
(Methods) . A blue link from A to B means that A would like to work with B in the future so that l AB = Y. A red link from B to A means the opposite so that 
l I BA =N. To predict which links are more likely to be Y (or N) in survey II (Methods), we apply two different methods: link reliability (LR) (b) and 
structural balance (SB) (c). (b) The LR method samples all possible partitions of nodes into groups. For each partition, it calculates the probability that 
l AB = Y according to that partition. The total probability that l AB = Y (reliability) is then a weighted sum of these probabilities over all possible partitions 
(Methods). The weight (likelihood) of a partition depends on how well it describes network connectivity. As an illustration, we show the matrix 
representation of two partitions. Each row/column corresponds to a node. Matrix elements show link types Y or N color coded in blue and red, 
respectively. The matrix on the left has a high likelihood because nodes in the same group have similar connection patterns; the matrix on the right has a 
low likelihood because nodes in the same group have different connection patterns. Finally, we use the reliability scores for each connection to obtain a 
prediction for observation 2. Link reliability values are color coded following the color bar. (c) The SB theory assumes that a balanced triad exists when 
there is an odd number of reciprocal relations. To obtain a score S SB for every link, we count the number of balanced triangles in the network t\, a \ when 
I 1 = Y minus the number of balanced triangles in the network when I 1 = N. Note that S SB only depends on triangles that include the link of interest. For 
instance, when l AC = Y, there are three balanced triangles involving /^c> while when l AC =N, there are no balanced triangles that involve Iac thus S^ c = 3. 
We use these scores to build a prediction for observation 2. Link scores are color coded following the color bar. 



Overlap between methods and hybrid scores. Although the SB 
method does not seem to consistently predict neither future conflict 
resolution nor appearance, it may still be possible that it captures 
different information from that captured by the LR method, so that 
the predictions of both methods are complementary (Fig. 2b). For 
conflict appearance, we find that the LR method accurately ranks 
n YY = 396 (YY, YZV) link pairs, whereas the SB method accurately 
ranks nyy = 305 pairs, of which 221 pairs match up in both methods. 
For conflict resolution, we find that the LR method accurately ranks 
Uny = 114 (NY, NN) link pairs, whereas the SB method accurately 
ranks n NY = 93, of which 70 pairs match up in both methods. 

Since the predictions of the SB method are not a perfect subset of 
the predictions of the LR method, it is interesting to see if a simple 
combination of both methods can provide a better prediction of 
conflict evolution than each of the two methods separately. To 



investigate this, we define a hybrid score S H that linearly combines 
the scores of both methods, S H = ocS LR + (1 -oc)S SB , where S SB is a 
properly normalized version of S SB and a is a parameter that enables 
us to interpolate between each one of the original methods 
(Methods). As we show in Fig. 3 this hybrid score does not improve, 
in general, the predictions of the LR method. For conflict appearance, 
even a small contribution of the SB score is enough to offset the 
predictive power of the LR method. That is not the case for conflict 
resolution, but in any case predictions do not significantly improve 
those of the pure LR method. 

Discussion 

Our contributions are of methodological and practical importance 
for team science. While conflict has long been recognized as one of 
the main issues in team performance, it is very hard to predict in 
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Figure 2 | Performance of parameter-free network methods for conflict prediction, (a) We show the performance of the LR (blue) and the SB (cyan) 
methods, for conflict appearance and resolution. For conflict appearance we consider the ratio between the number nyy of times that the score of a YYlink 
(positive in surveys I and II) is higher than the score of a YATlink (positive in survey I and negative in survey II) in the same team, and the number n Y N°f 
times the reverse is true. Analogously, for conflict resolution we consider the ratio between the number n NY of times that the score of a A/Ylink is higher 
than the score of a NN link, and the number n NN of times the the reverse is true. We denote these ratios as the normalized prediction performance for the 
appearance of conflict (tiyy^yn) and for the resolution of conflict (n NY /n NN ). To establish the significance of these results, we compare the values of the 
normalized prediction performance obtained for the SB and LR methods to those of the null model obtained by resampling the scores of all links within 
each team. We find that the LR method is significantly more accurate than the null model (p = 0.030 for conflict appearance and p = 0.032 for conflict 
resolution), whereas the SB method is not (p = 0.704 for conflict appearance andp = 0.232 for conflict resolution), (b) We showthe overlap of LR and SB 
methods, for conflict appearance and resolution. The numbers in the figure indicate the number of correctly ranked link pairs nyy and n NY (for conflict 
appearance and resolution, respectively) for each of the methods LR (blue) and SB (cyan), and for their overlap. 



small teams, precisely because the small size of the teams leaves us 
with little information about what factors are truly driving conflict 
dynamics. This poses a methodological challenge that we have 
addressed by investigating whether micro-based models of struc- 
tural balance or macro group -based models are more appropriate 
to tackle the problem. Our results demonstrate that it is possible 
(albeit difficult) to predict conflict in small teams. Specifically, we 
find that group-based models have more predictive power, which 
suggests that the lack of data is better addressed by the complete 
probabilistic treatment that these models make possible, than by 
the more detailed models of team dynamics. The immediate prac- 
tical implication of this finding is that, to avert conflict, groups 
can in principle be monitored in non-invasive ways (since only 
the network structure is needed, as opposed to, for example, 
detailed psychological accounts of team members). Our results 
thus highlight the relevance of the agenda put forward by Katz 



and coworkers, when they called for bringing the network per- 
spective back into team science 18 . 

Methods 

Data collection. During the academic year 2010-2011, we collected data on 
teamwork evaluation and preferences of 86 chemical engineering students that are 
grouped into teams facing an open ended project that lasts 9 months. We collected 
our data through an online survey that includes questions to evaluate different aspects 
of teamwork. We administered the same survey twice (December, survey I, and May, 
survey II). 

Our sample consists of sixteen teams with the same structure: a fourth year student 
that plays the role of team leader and first year team members; the number of team 
members for which we have complete data (that is, that reported in both surveys I and 
II) ranges from 3 to 7, with most teams having 5-6 members (Table 1). Team 
membership was determined as follows. First, individuals were randomly assigned to 
one of four large groups. From each of these groups, four teams were defined so as to 
balance personality traits of their members (based on a personality test) but otherwise 
randomly. 
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Figure 3 | Hybrid scores for conflict prediction. We introduce a hybrid score (S H ) obtained from the linear combination of the scores of both methods, 
S LR and S SB (Text and Methods). We plot the normalized prediction performance of the hybrid score for conflict appearance, (a), and conflict resolution, 
(b), as a function of a parameter a e [0, 1] that enables us to interpolate between S H (oc = 0) = S SB , and S H (ot = 1) = S LR . 
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In our analysis, we focus in the answers of two yes/no questions from the survey: 
1) "Would you choose this person to work with you in a new team?"; 2) "Would you 
choose this leader to lead a new team?". We use the answer to these questions as a 
proxy for the quality of interactions among team members. Thus, a yes answer would 
be a positive interaction whereas a no answer would be indicative of conflict among 
the pair of team members. 

With this information we construct a directed network for each one of the surveys I 
and II, in which the link from member A to member B can be of two types l AB = Y or 
l AB = N. We only consider interactions between pairs of team members for which we 
have complete information, that is, when both members have answered both surveys. 

Conflict prediction methods. Link reliability score. We have extended the formalism 
developed in 32 to obtain the link reliability score S LR , that is, the probability that a link 
from member A to member B is of type Y in survey II, l AB = Y, given the observation 
N° of all interactions reported in survey I. 

The fundamental assumption of this approach is that the structure of the 
network of interactions within a team can be satisfactorily accounted for by a model 
M y which is unknown but belongs to a family AA of models. Then, the probability 
that a link from member A to member B is of type Y, l AB = Y, given the observed 
network N° is 32 

S LR =P(1'L = Y\N°)= f dMp(l» B = Y\M)p(M\N°), (1) 
J M 

To estimate this integral we rewrite it, using Bayes theorem, as 32 ' 33 

f dMp(l^ = Y\M)p(N°\M)p(M) 

jO\ _ J_M 

/ dMp(N°\M)p(M) 
J M 



S LR =p(l'{ B = Y\N°) = 



(2) 



Here, p(N°\M) is the probability of the observed interactions given a model andp(M) 
is the a priori probability of a model, which we assume to be model-independent p(M) 
= const. 

For the family of stochastic block models, we have that for a given partition of team 
members into groups, there is a probability Q(oc, /?) of there being a link of type Y from 
a member in group a to a member in group and a probability ( 1 — Q(a, /?)) of there 
being a link of type N. Note that because we are dealing with a directed network, 
Q(a, /?) is not a symmetric matrix since for each block model team member A will be 
classified into two groups: a group for the outgoing links profile (a A out ), and a group 
for the incoming links profile (a A in ). Thus, if A belongs group a for outgoing links and 
B to group /? for incoming links, we have that 33 

p{l n AB = Y\M)=Q(x,P); (3) 
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where n Y/N {ct, /?) is the number of links of type Y/N between member groups a and 
and G out/in is the set of groups for outgoing/incoming link profiles in block model M. 
Additionally, the integral over all models in M can be separated into a sum over all 
possible partitions of the members into outgoing and incoming link groups, and an 
integral over all possible values of Q(a, /?). These integrals can be carried out exactly to 
get 32 - 33 



* p 
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• ffB '" ) + iV P (-H(P)), (5) 



+27 



where the sum is over all partitions of the team members into outgoing and incoming 
link groups, n{o A out , o B in ) = Dt : {y,n} ^ t (o"a out> &b in) is the total number of known 
interactions from groups o A out and o B in , and H(P) is a function that depends on the 
partition only 



ln(n(a,j8) + l)!- ^ ln(n r (a,jS))! 

T:{Y,N} 



(6) 



The sum in Eq. (5) can be estimated using the Metropolis algorithm to sample 
partitions 32 . 

Structural balance score. To obtain S S AB we look at all the possible triads of members in 
a team that include members A and B. Then, we count the number of balanced triads 
t\>ai(Y) when l AB = Y, and the number of balanced triads t h3L \(N) when l AB = N. We 
then obtain 



s <bd(y)-w(N). 



(7) 



According to structural balance theory, a balanced triad is one in which there is an 
odd number of positive reciprocal interactions. A positive reciprocal interaction is 
one such that l AB = l BA = Y. 

For all the graphs and discussions in the main text we use the definition 
above for the structural balance-based score. One may argue, however, that this 
definition is somewhat restrictive because if an interaction is not reciprocal to start 
with, each of the nodes can only improve overall balance by switching, but never by 



staying in the same state. Therefore, we also consider here a second structural balance 
score Sff 



S S AB = = f bal (Ub = IbA = Y) — ffeal (Ub = IbA = N) . 



(8) 



that is, the difference between the number of balanced triangles when both links AB 
and BA are positive and the number of balanced triangles when both links AB and BA 
are negative. As we show in Figure SI (Supplementary Information), this definition 
does not yield higher predictive power than the one discussed in the main text. 

Hybrid scores. For each link AB, we obtain a hybrid score S AB by combining LR and SB 
sores. However, because S LR is normalized, S LR e [0, 1] , and S SB is not, we first need to 
normalize S SB . For each link AB within team T, we obtain the normalized SB score S S A B B 
as follows 



" cSB _cSB ' 
Tmax Tmin 



(9) 



where S^ min and S f? max are the minimum and maximum scores in team T, 
respectively. 

We then obtain a hybrid score for each link S AB by linearly combining Sj^ and S S A B B , 

s&=«sS+(i-«)sX> (io) 

where a e [0, 1] is the parameter that allows us to interpolate between SB (a = 0) and 
LR (a = 1) score rankings. 
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